NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Using Text-Based Causal Inference to Disentangle Factors Influencing Online Review Ratings

https://doi.org/10.18653/v1/2025.naacl-long.562

Li, Linsen; Culotta, Aron; Mattei, Nicholas (January 2025, Association for Computational Linguistics)

Online reviews provide valuable insights into the perceived quality of facets of a product or service. While aspect-based sentiment analysis has focused on extracting these facets from reviews, there is less work understanding the impact of each aspect on overall perception. This is particularly challenging given correlations among aspects, making it difficult to isolate the effects of each. This paper introduces a methodology based on recent advances in text-based causal analysis, specifically CausalBERT, to disentangle the effect of each factor on overall review ratings. We enhance CausalBERT with three key improvements: temperature scaling for better calibrated treatment assignment estimates; hyperparameter optimization to reduce confound overadjustment; and interpretability methods to characterize discovered confounds. In this work, we treat the textual mentions in reviews as proxies for real-world attributes. We validate our approach on real and semi-synthetic data from over 600K reviews of U.S. K-12 schools. We find that the proposed enhancements result in more reliable estimates, and that perception of school administration and performance on benchmarks are significant drivers of overall school ratings.
more » « less
Free, publicly-accessible full text available January 1, 2026
AI Can Be a Powerful Social Innovation for Public Health if Community Engagement Is at the Core

https://doi.org/10.2196/68198

Bazzano, Alessandra N; Mantsios, Andrea; Mattei, Nicholas; Kosorok, Michael R; Culotta, Aron (January 2025, Journal of Medical Internet Research)

There is a critical need for community engagement in the process of adopting artificial intelligence (AI) technologies in public health. Public health practitioners and researchers have historically innovated in areas like vaccination and sanitation but have been slower in adopting emerging technologies such as generative AI. However, with increasingly complex funding, programming, and research requirements, the field now faces a pivotal moment to enhance its agility and responsiveness to evolving health challenges. Participatory methods and community engagement are key components of many current public health programs and research. The field of public health is well positioned to ensure community engagement is part of AI technologies applied to population health issues. Without such engagement, the adoption of these technologies in public health may exclude significant portions of the population, particularly those with the fewest resources, with the potential to exacerbate health inequities. Risks to privacy and perpetuation of bias are more likely to be avoided if AI technologies in public health are designed with knowledge of community engagement, existing health disparities, and strategies for improving equity. This viewpoint proposes a multifaceted approach to ensure safer and more effective integration of AI in public health with the following call to action: (1) include the basics of AI technology in public health training and professional development; (2) use a community engagement approach to co-design AI technologies in public health; and (3) introduce governance and best practice mechanisms that can guide the use of AI in public health to prevent or mitigate potential harms. These actions will support the application of AI to varied public health domains through a framework for more transparent, responsive, and equitable use of this evolving technology, augmenting the work of public health practitioners and researchers to improve health outcomes while minimizing risks and unintended consequences.
more » « less
Free, publicly-accessible full text available January 1, 2026
PACE: Participatory AI for Community Engagement

https://doi.org/10.1609/hcomp.v12i1.31610

Hassan, Saad; Asad, Syeda_Mah Noor; Eslami, Motahhare; Mattei, Nicholas; Culotta, Aron; Zimmerman, John (October 2024, Proceedings of the AAAI Conference on Human Computation and Crowdsourcing)

Public sector leverages artificial intelligence (AI) to enhance the efficiency, transparency, and accountability of civic operations and public services. This includes initiatives such as predictive waste management, facial recognition for identification, and advanced tools in the criminal justice system. While public-sector AI can improve efficiency and accountability, it also has the potential to perpetuate biases, infringe on privacy, and marginalize vulnerable groups. Responsible AI (RAI) research aims to address these concerns by focusing on fairness and equity through participatory AI. We invite researchers, community members, and public sector workers to collaborate on designing, developing, and deploying RAI systems that enhance public sector accountability and transparency. Key topics include raising awareness of AI's impact on the public sector, improving access to AI auditing tools, building public engagement capacity, fostering early community involvement to align AI innovations with public needs, and promoting accessible and inclusive participation in AI development. The workshop will feature two keynotes, two short paper sessions, and three discussion-oriented activities. Our goal is to create a platform for exchanging ideas and developing strategies to design community-engaged RAI systems while mitigating the potential harms of AI and maximizing its benefits in the public sector.
more » « less
Full Text Available
Predicting hurricane evacuation behavior synthesizing data from travel surveys and social media

https://doi.org/10.1016/j.trc.2024.104753

Bhowmik, Tanmoy; Eluru, Naveen; Hasan, Samiul; Culotta, Aron; Roy, Kamol Chandra (August 2024, Transportation Research Part C: Emerging Technologies)

Full Text Available
Use Open Source for Safer Generative AI Experiments

Culotta, Aron; Mattei, Nicholas (January 2024, MIT Sloan Management Review)

Commercial AI services can put proprietary data at risk — but there are alternatives.
more » « less
Full Text Available
Identifying Hurricane Evacuation Intent on Twitter

https://doi.org/10.17603/ds2-x9kt-jy53

Culotta, Aron; Hasan, Samiul (October 2023, Designsafe-CI)

Evacuations have a significant impact on saving human lives during hurricanes. However, as a complex dynamic process, it is typically difficult to know individual evacuation decisions in real-time. Since a large amount of information is continuously posted through social media platforms, we can use them to understand individual evacuation behavior. In this paper, we collect tweets during Hurricane Irma in 2017 and train a text classifier in an active learning way to distinguish tweets expressing positive evacuation decisions from both negative and irrelevant ones. Additionally, we perform a demographic analysis and content clustering to investigate the potential causes and correlates of evacuation decisions. The results can be used to help inform planning strategies of emergency response agencies.
more » « less
Forecasting COVID-19 Vaccination Rates using Social Media Data

https://doi.org/10.1145/3543873.3587639

Li, Xintian; Culotta, Aron (April 2023, WWW '23 Companion: Companion Proceedings of the ACM Web Conference 2023)

The COVID-19 pandemic has had a profound impact on the global community, and vaccination has been recognized as a crucial intervention. To gain insight into public perceptions of COVID-19 vaccines, survey studies and the analysis of social media platforms have been conducted. However, existing methods lack consideration of individual vaccination intentions or status and the relationship between public perceptions and actual vaccine uptake. To address these limitations, this study proposes a text classification approach to identify tweets indicating a user’s intent or status on vaccination. A comparative analysis between the proportions of tweets from different categories and real-world vaccination data reveals notable alignment, suggesting that tweets may serve as a precursor to actual vaccination status. Further, regression analysis and time series forecasting were performed to explore the potential of tweet data, demonstrating the significance of incorporating tweet data in predicting future vaccination status. Finally, clustering was applied to the tweet sets with positive and negative labels to gain insights into underlying focuses of each stance.
more » « less
Full Text Available
Reducing Cross-Topic Political Homogenization in Content-Based News Recommendation

https://doi.org/10.1145/3523227.3546782

Shivaram, Karthik; Liu, Ping; Shapiro, Matthew; Bilgic, Mustafa; Culotta, Aron (September 2022, RecSys '22: Proceedings of the 16th ACM Conference on Recommender Systems)

Content-based news recommenders learn words that correlate with user engagement and recommend articles accordingly. This can be problematic for users with diverse political preferences by topic --- e.g., users that prefer conservative articles on one topic but liberal articles on another. In such instances, recommenders can have a homogenizing effect by recommending articles with the same political lean on both topics, particularly if both topics share salient, politically polarized terms like "far right" or "radical left." In this paper, we propose attention-based neural network models to reduce this homogenization effect by increasing attention on words that are topic specific while decreasing attention on polarized, topic-general terms. We find that the proposed approach results in more accurate recommendations for simulated users with such diverse preferences.
more » « less
Full Text Available
Identifying Hurricane Evacuation Intent on Twitter

https://doi.org/10.1609/icwsm.v16i1.19320

Li, Xintian; Hasan, Samiul; Culotta, Aron (June 2022, Proceedings of the International AAAI Conference on Web and Social Media)

Evacuations have a significant impact on saving human lives during hurricanes. However, as a complex dynamic process, it is typically difficult to know individual evacuation decisions in real-time. Since a large amount of information is continuously posted through social media platforms, we can use them to understand individual evacuation behavior. In this paper, we collect tweets during Hurricane Irma in 2017 and train a text classifier in an active learning way to distinguish tweets expressing positive evacuation decisions from both negative and irrelevant ones. Additionally, we perform a demographic analysis and content clustering to investigate the potential causes and correlates of evacuation decisions. The results can be used to help inform planning strategies of emergency response agencies.
more » « less
Full Text Available
Predicting traffic demand during hurricane evacuation using Real-time data from transportation systems and social media

https://doi.org/10.1016/j.trc.2021.103339

Roy, Kamol Chandra; Hasan, Samiul; Culotta, Aron; Eluru, Naveen (October 2021, Transportation Research Part C: Emerging Technologies)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records